Cumulon: Matrix-Based Data Analytics in the Cloud with Spot Instances

نویسندگان

  • Botong Huang
  • Nicholas W. D. Jarrett
  • Shivnath Babu
  • Sayan Mukherjee
  • Jun Yang
چکیده

We describe Cümülön, a system aimed at helping users develop and deploy matrix-based data analysis programs in a public cloud. A key feature of Cümülön is its end-to-end support for the so-called spot instances—machines whose market price fluctuates over time but is usually much lower than the regular fixed price. A user sets a bid price when acquiring spot instances, and loses them as soon as the market price exceeds the bid price. While spot instances can potentially save cost, they are difficult to use effectively, and run the risk of not finishing work while costing more. Cümülön provides a highly elastic computation and storage engine on top of spot instances, and offers automatic cost-based optimization of execution, deployment, and bidding strategies. Cümülön further quantifies how the uncertainty in the market price translates into the cost uncertainty of its recommendations, and allows users to specify their risk tolerance as an optimization constraint.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spotlytics: How to Use Cloud Market Places for Analytics?

In contrast to fixed-priced cloud computing services, Amazon’s Spot market uses a demand-driven pricing model for renting out virtual machine instances. This allows for remarkable savings when used intelligently. However, a peculiarity of Amazon’s Spot market is, that machines can suddenly be taken away from the user if the price on the market increases. This can be considered as a distinct for...

متن کامل

Cumulon: Cloud-Based Statistical Analysis from Users Perspective

Cumulon is a system aimed at simplifying the development and deployment of statistical analysis of big data on public clouds. Cumulon allows users to program in their familiar language of matrices and linear algebra, without worrying about how to map data and computation to specific hardware and software platforms. Given user-specified requirements in terms of time, money, and risk tolerance, C...

متن کامل

Application of Big Data Analytics in Power Distribution Network

Smart grid enhances optimization in generation, distribution and consumption of the electricity by integrating information and communication technologies into the grid. Today, utilities are moving towards smart grid applications, most common one being deployment of smart meters in advanced metering infrastructure, and the first technical challenge they face is the huge volume of data generated ...

متن کامل

Analysis and Prediction of Amazon EC2 Spot Instance Prices

Amazon Elastic Compute Cloud (EC2) is a web service that provides secure, re-sizable compute capacity in the cloud. It is designed to make web-scale cloud computing for developers. There are three different ways of pricing for Amazon EC2 instances: On-demand, Reserved instances, and Spot instances. Spot instances provide users with access to unused Amazon EC2 capacity at high discounts relative...

متن کامل

Using Spot Instance SLAs for Reliable Cloud Federation

Spot instances are a commonly offered by IaaS cloud providers to opportunistically utilize spare capacity and meet temporary user demand for additional resources at low cost. Although the availability of service SLAs is a core paradigm of cloud computing, spot instances typically come without any service quality guarantees. We aim to extend the spot instance service to provide SLAs for eviction...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • PVLDB

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2015